Constrained Proper Sampling of Conformations of Transition State Ensemble during Protein Folding
نویسندگان
چکیده
Characterizing the conformations of protein in the transition state ensemble (TSE) is important for studying protein folding. A promising approach pioneered by Vendruscolo et al40 to study TSE is to generate conformations that satisfy all constraints imposed by the experimentally measured φ-values that provide information about the native-likeness of the transition states. Faisca et al12 generated conformations of TSE based on the criterion that, starting from a TS conformation, the probabilities of folding and unfolding are about equal through Metropolis Monte Carlo (MC) simulations. In this study, we use the technique of constrained sequential Monte Carlo method (CSMC)26,44 to generate TSE conformations of acylphosphatase (AcP) of 98 residues that satisfy the φ-value constraints, as well as the criterion that each conformation has a folding probability of 0.5 by Metropolis MC simulations. We adopt a two stage process and first generate 5,000 contact maps satisfying the φ value constraints. Each contact map is then used to generate 1,000 properly weighted conformations. After clustering similar conformations, we obtain a set of properly weighted samples of 4,185 candidate clusters. Representative conformation of each of these cluster is then selected and 50 runs of Markov chain Monte Carlo (MCMC) simulation are carried using a regrowth move set. We then select a subset of 1,501 conformations that have equal probabilities to fold and to unfold as the set of TSE. These 1,501 samples characterize well the distribution of transition state ensemble conformations of acylphosphatase. Compared with previous studies, our approach can access much wider conformational space and can objectively generate conformations that satisfy the φ-value constraints and the criterion of 0.5 folding probability without bias. In contrast to the previous studies, our results show that transition state conformations are very diverse and are far from native-like when measured in cRMSD (Cartesian Root-Mean-Square Deviation): the average cRMSD between TS conformations and the native structure is 9.4Å for this short protein, instead of 6Å reported in previous studies. In addition, we found that the average fraction of native contacts in the TSE is 0.37, with enrichment in native-like β-sheets and a shortage of long range contacts, suggesting such contacts form at a later stage of folding. We further calculate the first passage time of folding of TSE conformations through calculation of physical time associated with the regrowth moves in MCMC simulation through mapping such moves to a Markovian state space model with transition time obtained by Langevin dynamics simulations. Our results indicate
منابع مشابه
Reconstruction of the src-SH3 protein domain transition state ensemble using multiscale molecular dynamics simulations.
We use an integrated computational approach to reconstruct accurately the transition state ensemble (TSE) for folding of the src-SH3 protein domain. We first identify putative TSE conformations from free energy surfaces generated by importance sampling molecular dynamics for a fully atomic, solvated model of the src-SH3 protein domain. These putative TSE conformations are then subjected to a fo...
متن کاملNature of the transition state ensemble for protein folding
The ability of a protein to fold rapidly to its unique native state from any of its possible unfolded conformations depends critically on the transition state for folding. A variety of different physical characterizations of this ensemble have been proposed, spanning a spectrum from common partial structures to a broad collection of relatively unrelated conformations. Here we determine and comp...
متن کاملKinetic definition of protein folding transition state ensembles and reaction coordinates.
Using distributed molecular dynamics simulations we located four distinct folding transitions for a 39-residue betabetaalphabeta protein fold. To characterize the nature of each room temperature transition, we calculated the probability of transmission for 500 points along each free energy barrier. We introduced a method for determining transition states by employing the transmission probabilit...
متن کاملPathways for protein folding: is a new view needed?
Theoretical studies using simplified models of proteins have shed light on the general heteropolymeric aspects of the folding problem. Recent work has emphasized the statistical aspects of folding pathways. In particular, progress has been made in characterizing the ensemble of transition state conformations and elucidating the role of intermediates. These advances suggest a reconciliation betw...
متن کاملLong-range order in the src SH3 folding transition state.
One of the outstanding questions in protein folding concerns the degree of heterogeneity in the folding transition state ensemble: does a protein fold via a large multitude of diverse "pathways," or are the elements of native structure assembled in a well defined order? Herein, we build on previous point mutagenesis studies of the src SH3 by directly investigating the association of structural ...
متن کامل